MUTATED RECOMBINANT COLLAGENS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This is a continuation of and claims priority under 35 U.S.C.§ 120 to U.S. Serial No. 

08/278,774, filed July 22, 1994, now U.S. Patent No. 6,653,450, which is a continuation-in- 
part of U.S. Serial No. 08/183,648, filed January 18, 1994, now Patent No. 5,667,839, which 
is a continuation-in-part of U.S. Serial No. 08/01 1,643 filed January 28, 1993, abandoned, all 
of which are incorporated herein by reference. 

10 

Field of the Invention 

The field of this invention is recombinant procollagens containing novel cleavage 

sites. 

15 Background of the Invention 

Collagen has provided a number of commercial applications including use as a 
convenient substrate for cell culture, as well as in the manufacture of biocompatible materials 
(e.g., artificial skin) having a variety of therapeutic applications in humans. Commercially 
available collagens are primarily isolated from freshly killed farm animals (e.g., cows, pigs, 

20 chickens) in polymerized form or in soluble form derived by enzymatic or chemical treatment 
of polymerized collagen. However, polymerized collagens, and soluble collagens derived 
from polymerized collagens, are of limited usefulness due to the presence of natural 
crosslinks within the collagen. Furthermore, non-human collagens can provoke undesirable 
immune responses when administered to human subjects. 

25 As one alternative, human collagen can be purified from human sources such as 

human placenta, as described in U.S. Patent No. 5,002,071 (Research Development 
Foundation) and in copending U.S. application Serial No. 07/921,810 (Collagen 
Corporation). In addition to source limitations and the risk of contamination by human 
pathogens such as Hepatitis viruses and HIV, the methods for recovering collagen from these 

30 sources bias the type of collagen recovered. Furthermore, because collagen is derived from 
naturally crosslinked tissues, the collagen recovered is not entirely homogenous. Another 
approach to the production of human collagen is the expression of recombinant human 
collagen in the milk of transgenic animals, as described in copending U.S. application Serial 
Nos. 08/183,648 and 08/01 1,643 (Collagen Corporation). However, this approach subjects 

35 the recombinant collagen to any host deficiencies in translational processing. 

The term collagen refers to a type of protein that encompasses a class of structurally 
related polypeptides consisting of helical collagen chains and homo- and heteromeric 
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polymers thereof. Collagen chains often contain both long helical domains and non-helical 
extensions, or telopeptides. The helical domains comprise -(GXY) n - repeats, where X and/or 
Y are frequently proline or hydroxyproline. 

Collagen polypeptide chains are encoded by a number of related genes, See, Adams, 
5 S.L. Amer J Respir Cell and Mol Biol (1989) 1:161-168; Mayne and Brewton (1993) Current 
Opinion in Cell Biology 5:883-890; van der Rest and Bruckner (1993) Current Opinion in 
Structural Biology 3:430-436. Collagen-encoding transcripts are initially translated into 
procollagen chains which undergo a variety of post-translational events such as processing, 
secretion and assembly (e.g. disulfide exchange, prolyl residue hydroxylation, glycosylation, 

10 and crosslinking of the helical chains; see, for example, Prockop et al. (1984) New England J 
Med 3111 :376-386) to form mature collagen chain helices. In addition, procollagens contain 
N- and C-terminal propeptides that facilitate collagen formation. See, for example, Lee et al. 
(1992) J Biol Chem 267, 24126-24133. 

Many of these post-translational events are catalyzed by enzymes thought to be 

15 expressed only in or by cells which naturally produce collagen. For example, the procollagen 
propeptides are cleaved from the procollagens by tissue-specific procollagen N- and C- 
terminal proteases. Accordingly, recombinant production of collagen has been severely 
restricted by the inability of conventional cellular expression systems to efficiently post- 
translationally modify procollagen to form functionally mature collagen. 

20 Site-specific proteolysis of unrelated fusion proteins is reported by Paul Carter in 

(1990), Protein Purification: From Molecular Mechanisms to Large-Scale Processes, p. 181- 
193, American Chemical Society Research Symposium #47; Brenner and Fuller (1992) 
PNAS 89:922-926; and Seeboth, et al. (1992) Appl Microbiol Biotechnol 37:621-625; Liu 
and Miller (1989) WO 89/12678. Chemical cleavage is described in Fontana and Gross 

25 (1986), Chapter 2 of In Practical Protein Chemistry - A Handbook, Darby, Ed; John Wiley 
and Sons, London. 

SUMMARY OF THE INVENTION 
The invention provides recombinant procollagen chains having a natural collagen 
30 polypeptide chain separated from a propeptide by a non-natural site-specific proteolytic agent 
recognition site (i.e. the site is not naturally present at that location of a natural procollagen 
chain). The propeptide is often a C-terminal propeptide and, frequently, a natural 
procollagen C-terminal propeptide. Alternatively, the recombinant procollagen chains may 
further comprise a second propeptide separated from the opposite end of the natural collagen 
35 chain by a second non-natural site-specific proteolytic agent recognition site. 

A wide variety of propeptides and site-specific proteolytic agent recognition sites may 
be used. The propeptide is capable of increasing the yield of at least one of secreted 
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procollagen (trimeric procollagen chains) or collagen monomers (trimeric collagen chains) 
from cells expressing procollagen chains comprising the propeptide, as compared with cells 
expressing the same collagen chains alone, without the propeptide. The non-natural site- 
specific proteolytic agent recognition site is generally selected to provide a substrate for a 
5 convenient site-specific proteolytic agent. The agent is commonly a protease or a chemical 
agent which selectively cleaves at accessible and preferentially labile peptide bonds. Where 
two site-specific proteolytic agent recognition sites and two propeptides are present, the 
recognition sites may have the same or different amino acid sequences with the same or 
different site-specific proteolytic agent affinities and specificities. 

10 The subject procollagen chains may be constructed such that the site-specific 

proteolytic agent cleavage site is a peptide bond that separates the collagen chain and/or the 
propeptide from the remainder of the procollagen chain. Alternatively, the cleavage site may 
be a peptide bond that separates, from the remainder of the procollagen chain, the collagen 
chain (or propeptide) joined to at least one amino acid which is not naturally present at that 

15 location of a natural collagen chain (or propeptide). 

The invention provides collagens produced by contacting the subject recombinant 
procollagen with site-specific proteolytic agents capable of selectively cleaving the 
procollagen chains at the site-specific proteolytic agent recognition sites, wherein the 
resulting chains comprise terminal peptides which may have non-natural amino acid 

20 sequences {i.e. not naturally present at that location of a natural collagen chain). The 

resulting collagen may be employed directly {e.g. to coat cell culture dishes or cultureware) 
or further polymerized and/or provided in sterile, nontoxic, biocompatible collagen 
compositions suitable for administration to humans. 

The invention also provides nucleic acids encoding the subject procollagen chains, as 

25 well as vectors and cells comprising such nucleic acids operably linked to transcription 
regulatory elements not naturally linked to such nucleic acids. The invention provides 
processes for the production of recombinant procollagen and/or collagen chains in and from 
such cells. The invention also provides methods of utilizing the resulting procollagen and 
collagen, for example, coating cultureware and augmenting localized tissue in a host by, for 

30 example, subcutaneous administration. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows a three-dimensional computer model of the N-terminal propeptide- 
telopeptide region of human procollagen type I (a triple helix of two ccl chains and one a2 
35 chain). The structure was produced on a Silicon Graphics workstation using Biosym 

software. The energy minimized structure is produced after manual chain pull-ins to speed 
the process of computing the energy minimums. Points of cleavage with N-pro-peptidase are 
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marked with a 

Figure 2 shows a three-dimensional computer model of the C-terminal propeptide- 
telopeptide region of human procollagen type I (a triple helix of two al chains and one <x2 
chain). The structure was produced on a Silicon Graphics workstation using Biosym 
software. The energy minimized structure is produced after manual chain pull-ins to speed 
the process of computing the energy minimums. Points of cleavage of the extra-helical 
peptide from the C-terminal pro-region are marked with a In the two al chains, this is 
before Gly 38; in the a2 chain, it is before Glu 27. 

Figure 3 shows a three-dimensional computer model of the N-terminal propeptide- 
telopeptide region of a mutant human procollagen type I containing the DDDK insertion. 
The structure was produced on a Silicon Graphics workstation using Biosym software. The 
energy minimized structure is produced after manual chain pull-ins to speed the process of 
computing the energy minimums. The insertional loop members are marked with a 

Figure 4 shows the proteolytic cleavage site of the mutant human procollagen type I 
containing the DDDK insertion. Points of cleavage of N-propeptidase are marked with a 
Regions in whichc the al chains are "coiled" are disignated with a "10". 

DETAILED DESCRIPTION OF PARTICULAR 
EMBODIMENTS OF THE INVENTION 

The invention provides recombinant procollagen chains comprising a natural collagen 
chain, a propeptide, and a non-natural site-specific proteolytic agent recognition site (i.e. not 
naturally present at that location of a natural collagen chain) located between the collagen 
chain and the propeptides. 

The propeptide increases the yield of at least one of secreted procollagen (trimeric 
procollagen chains) or collagen monomers (trimeric collagen chains) recoverable from cells 
expressing the subject recombinant procollagen chains, as compared with cells expressing the 
collagen chains alone, without the propeptide. Generally, the yield is enhanced by at least 
50%, preferably at least about 100%, more preferably at least about 200%, and most 
preferably at least about 1,000% in at least one expression system. 

Collagen refers to a variety of helical arrays of collagen chains and includes both 
polymerized and monomelic collagen. The propeptides are believed to improve collagen 
and/or procollagen yields by facilitating or increasing at least one of proper expression, 
processing, secretion, or polymerization of the procollagen chain, the collagen chain, and/or a 
collagen monomer. The function of the propeptide is readily monitored by comparing the 
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yields of recoverable collagen and/or procollagen with and without the presence of the 
propeptide in in vitro or in vivo expression systems including lysates, transformed or 
transfected cells (such as mammalian cells or yeast), transgenic animals, etc. Specific 
examples of assays for propeptides are described below. 
5 The propeptide may take a wide variety of forms and sizes: it may be a natural 

procollagen propeptide, or a propeptide not naturally present as a propeptide in a procollagen 
chain, as long as it provides the procollagen with the requisite collagen yield. Generally, 
propeptides are at least 10, preferably at least about 25, more preferably at least about 75, and 
most preferably at least about 200 amino acids in length. The propeptide is generally located 
10 at the C-terminus or N-terminus of the procollagen chain, or the procollagen chain may have 
propeptides at both termini. Where the recombinant procollagen chains contain both C- and 
N-terminal propeptides, the chains often comprise a second (and often non-natural) site- 
specific proteolytic agent recognition site located between the collagen chain and the second 
propeptide. 

15 Preferred procollagen chains comprise at least a C-terminal propeptide, frequently a 

natural C-terminal portion of a protein with collagen-like repeats and/or a helical, especially 
triple-helical, polymeric form, such as complement protein Clq-C, acetocholine esterase, or 
lung surfactant protein. Especially preferred are natural procollagen C-terminal propeptides, 
such as collagen Type I, II, III, IV, V and XI C-terminal propeptides. Preferred N-terminal 

20 propeptides comprise at least one globular domain and often consist essentially of a single 
globular domain. Especially preferred are natural procollagen N-terminal propeptides, such 
as collagen type I, II, III, IV, V, and XI N-terminal propeptides. 

After cleavage from the collagen chain, the propeptides are preferably purified out of 
the collagen composition. The propeptides are preferably selected such that, if small 

25 quantities of the propeptides remain in the collagen composition after processing, the residual 
propeptides do not interfere with the intended application of the resulting composition. For 
example, for in vivo use, the composition should not contain contaminants present in an 
amount sufficient to cause a toxic or immunogenic response or reaction in the patient. 

A wide variety of site-specific proteolytic agent recognition sites may be used. The 

30 recognition site is non-natural to a natural procollagen chain, meaning that the site occupies a 
portion of the procollagen chain not naturally occupied by the recognition site. Frequently, 
the site is not naturally present anywhere in the natural counterpart of the procollagen chain. 
The site generally comprises one or more amino acid residues or derivatives thereof. The 
recognition site may comprise a modified {e.g., methylated) residue, especially for some 

35 chemical proteolytic agents. These sites are selected to provide substrates for at least one 
convenient site-specific proteolytic agent, preferably a commercially available site-specific 
proteolytic agent. The precise site selection will depend on the selected expression system, 
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the selected propeptide, and the availability of the site-specific proteolytic agent. For 
example, the site should provide a highly selective target for the selected site-specific 
proteolytic agent, such that the target site (or both target sites) is cleaved while minimizing 
hydrolysis of peptide bonds within the collagen chain. 
5 A large number of site-specific proteolytic agents and their recognition sites are 

known and/or commercially available. See, for example, Carter (1990, supra), Saito et al. 
(1987) J. Biochem 101, 123-134, etc. For example, a wide variety chemical cleavage agents 
and their recognition sites are known, e.g. hydroxylamine: NIG; formic acid: DIP; acetic 
acid: DIP; cyanogen bromide: Ml ; BNPS-skatole: Wl; o-iodosobenzoic acid: WI ; N- 

10 chlorosuccinimide: Wi; etc. Alternatively, natural or synthetic proteases (e.g. catalytic 
antibodies, etc.) may be used. Examples of proteases and their recognition sites include: 
chymotrypsin: Wl and Yl and Fl ; collagenase: P-XiG-P, endoproteinase Lys-C: Kl ; 
enterokinase: D-D-D-KI (SEQ ID NO: 1:), Factor X,: I-E-G-Rl (SEQ ID NO:2:); kallikrein: 
P-F-Rl ; renin: Y-I-H-P-F-H-L 1 L (SEQ ID NO:3:), H64A subtilisin BPN: A-A-H-Yl (SEQ 

15 ID NO:4:), thrombin: R-G-P-RI ( SEQ ID NO:5:); trypsin: Rl and Kl ; ubiquitin protein 

peptidase: ubiquitin 1 ; etc. A wide variety of methods may be used to increase the efficiency 
of cleavage at a target site (e.g. the provision of flanking poly-glycine peptides to enhance 
accessibility) and/or to reduce non-target site cleavage (e.g. optimizing concentration ratios 
and reaction conditions). 

20 Preferred site/site-specific proteolytic agent combinations yield a collagen product 

that is conveniently adapted to the ultimate intended application. For example, any residual 
non-natural terminal residues should not interfere with and, preferably, enhance the 
performance (e.g. the promotion of cell growth or adherence to coated cultureware) of the 
resulting collagen product vis-a-vis the natural collagen counterpart. For in vivo use, the 

25 resulting collagen should not be immunologically reactive in the host from which the 
collagen chain is derived. Generally, immunological non-reactivity is effected by 
minimizing the number of non-collagen chain amino acid residues which remain joined to the 
collagen chain after site-specific proteolytic agent cleavage. Preferred site/site-specific 
proteolytic agent combinations yield collagen chains with fewer than 75, preferably fewer 

30 than 25, more preferably fewer than 10, most preferably fewer than 5 and, especially, zero, 
non-natural collagen chain terminal amino acid residues. Hence, preferred sites and site- 
specific proteolytic agents provide cleavage at or near the junction with natural collagen 
chain residues. 

Where two site-specific proteolytic agent recognition sites and two propeptides are 
35 present, the site-specific proteolytic agent recognition sites may have the same or different 
amino acid sequences, with the same or different site-specific proteolytic agent affinities and 
specificities. The site-specific proteolytic agent recognition site(s) generally, but need not, 
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abut both the collagen chain and respective propeptide; i.e. the subject procollagen chains 
may also have a natural or non-natural (i.e. not naturally present at that location of a natural 
procollagen chain) amino acid sequence between the site-specific proteolytic agent 
recognition site and the collagen chain, and/or between the site-specific proteolytic agent 
5 recognition site and the propeptide. Accordingly, depending on the selected recognition site, 
site-specific proteolytic agent, and presence or absence of residues separating the cleavage 
site from the collagen chain, the resulting (after cleavage) recombinant collagen chains may 
comprise one or both terminal peptides having a non-natural amino acid sequence. By way 
of example: 

1 0 for human oc 1 (I) collagen, a preferred N-terminal peptide is NH 2 - 

X l X 2 X 3 X«X s X e X T X g X 9 X 10 X ll X I2 X,jX 14 X IS QLSYGYDEKSTGGISVP (SEQ ID NO:6:), 
where "X r X ls " is other than PGPPGPPGLGGNFAP (SEQ ID NO:7:); and a preferred C- 
terminal peptide is PPQEKAHDGGRYYRA(SEQ ID NO: 8:) 
X 1 X 2 X 3 X 4 X5X 6 X 7 X 8 X 9 X 10 X 11 X 12 X 1 3X I4 X 15 , where "X,-X I5 " is other than 

1 5 DDAN WRDRDLEVDT (SEQ ID NO:9:); 

for human a 2(1) collagen, a preferred N-terminal peptide is NH 2 - 
X.X.XjX^X^XgXgX.oXuX^X.jX^X^QYDGKGVGLGP (SEQ ID NO: 10:), where "X,- 
X 15 " is other than PGPPGPPGLGGNFAA (SEQ ID NO: 1 1 :); and a preferred C-terminal 
peptide is GGGYDFGYDGDFYRA(SEQ ID 

20 NO: 12:)X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 X 10 X 11 X I2 X I 3X I4 X 15 , where "X,-X I5 " is other than 
DQPRSAPSLRPKDYE (SEQ ID NO: 13:); 

for human a 1(11) collagen, a preferred N-terminal peptide is NH 2 - 
X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 X 10 X I1 X 12 X 13 X I4 X 15 QMAGGFDEKAGGAQMGVQM (SEQ ID 
NO: 14:), where "X,-X 15 " is other than PGPPGPPGLGGNFAA (SEQ ID NO: 15:); and a 

25 preferred C-terminal peptide is GPREKGPDPMQYMRA(SEQ ID 

NO:16:)X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 X IO X 11 X 12 X I3 X 14 X 15 , where "X,-X l5 " is other than 
DEADSTLRQHDVEVD (SEQ ID NO: 1 7:); 

for human a l(III) collagen, a preferred N-terminal peptide is NH 2 - 
X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 X 10 X 11 X I2 X 13 X 14 X 15 QYDSYDVKSGVAVG (SEQ ID NO: 18:), where 

30 "X r X l5 " is other than GICESCPTGPQNYSP (SEQ ID NO: 19:); and a preferred C-terminal 
peptide is GIGGEKAGGFAPYYG(SEQ ID 

NO:20:)X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 X 10 X II X 12 X I3 X 14 X 15 , where "X,-X 15 " is other than 
DEPMDFKINTDEIMT (SEQ ID NO:21 :). 

The invention provides recombinant collagen chains produced by contacting the 
35 subject recombinant procollagen chains with site-specific proteolytic agents capable of 

selectively cleaving the procollagen chain at site-specific proteolytic agent recognition sites. 
Useful collagen compositions produced by such methods may retain residual site-specific 
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proteolytic agents, wherein the site-specific proteolytic agents are present at concentrations 
that may exceed 1 picogram, frequently exceed 1 nanogram, and more frequently exceed 1 
microgram per gram of collagen protein. Resulting collagen compositions may be subjected 
to conditions known in the art to be conducive to higher-order polymerization of the 
5 collagen, for example, fibril formation. Monomelic or polymeric collagen compositions may 
be further subjected to a wide variety of manipulations depending on the intended 
application. For example, the collagen may be diluted in an solvent for optimal adsorption to 
solid substrates (e.g. glass or plastics such as polystyrene, polyethylene, polypropylene, 
acetate, nylon, etc.) useful in tissue or cell culture, or purified and sterilized (e.g. filter 

10 sterilization) for use in vivo, such as for tissue augmentation. 

The invention provides nucleic acids encoding the subject recombinant procollagen 
chains. These nucleic acids may have a wide variety of forms and sizes and frequently 
comprise non-coding sequences which facilitate transcription of the encoded procollagen 
chains. Constructs comprising such nucleic acids operably linked to transcription regulatory 

15 elements are spliced into replication and/or transfection vectors, which are then used to 
transfect/transform cells. 

Nucleic acids encoding the collagen chain portions (e.g. coll A 1, coll A2 and col3Al) 
and the site-specific proteolytic agent recognition portions of the subject nucleic acids are 
readily available, for example, through published sequences, including electronic databases 

20 (e.g., Genbank, the National Center for Biotechnology Information at the National Library 
for Medicine, etc.), through commercial vendors, through library hybridization with 
otherwise available collagen-encoding nucleic acids, or through known molecular cloning 
techniques. Materials and methods for constructing (e.g., recombinantly or synthetically) the 
subject procollagen-encoding nucleic acids, expression vectors, transformed and transfected 

25 cells, processes for producing a recombinant procollagen chain (including selection from a 
wide variety of cells, e.g., cultured mammalian cells such as fibroblasts, CHO cells, yeast, 
mammary gland cells of transgenic mammals, etc.), transfecting/transforming and culturing 
the cells under conditions suitable for the expression of the nucleic acid, and recovering and 
purifying recombinant procollagen chains are disclosed herein or otherwise known in the art. 

30 See, for example, Molecular Cloning, A Laboratory Manual (2nd Ed., Sambrook, Fritsch and 
Maniatis, Cold Spring Harbor), Current Protocols in Molecular Biology (Eds. Aufubel, Brent, 
Kingston, More, Feidman, Smith and Stuhl, Greene Publ. Assoc., Wiley-Interscience, NY, 
NY, 1992). 

In a preferred embodiment, construct design is effected by first preparing a three- 
35 dimensional model of a procollagen helix at the propeptide-telopeptide junction, then 

introducing amino acid substitutions that would allow cleavage with specific proteolytic 
enzymes. For example, Figures 1 and 2 depict the structures of the N- and C- terminal 
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propeptide-telopeptide regions of human type I procollagen, respectively. The structures are 
energy minimized and the native cleavage sites are marked. Molecular modeling is 
performed to ensure that the substitutions do not disrupt the folding of this region so as to 
inhibit processing, secretion, helical formation, or site-specific proteolytic agent accessibility. 

Molecular constructs are then generated by either mutating procollagen-encoding 
cDNAs or splicing nucleic acids encoding the selected collagen chain with nucleic acids 
encoding the selected site-specific proteolytic agent recognition site. Mutation may be 
effected by site-directed mutagenesis, nucleotide insertion or substitution (e.g., by PCR-based 
mutagenesis) or replacement of oligonucleotides by cloning. The insertion site is preferably 
at or within 100, preferably within 30, more preferably within 10 nucleotides distal to the 
procollagen site-specific proteolytic agent cleavage site encoding sequence. The substituted 
nucleotides are within 60, preferably within 36, preferably within 18, nucleotides distal to the 
procollagen site-specific proteolytic agent cleave site encoding sequence. Preferred 
constructs additionally comprise nucleic acids including a promoter region necessary for the 
expression of the encoded procollagen in an appropriate host. Preferred promoters that are 
constitutive in cultured cells include cytomegalovirus (CMV), simian virus (SV), and Rous 
sarcoma virus (RSV) promoters. Preferred promoters for tissue-specific expression (e.g., in 
mammary gland cells) of transgenic animals include casein or acid whey protein promoters. 
Preferred promoters for expression in yeast include phosphoglycerol kinase (PKG), alcohol 
dehydrogenase (ADH), and galactose (GAL). 

The construct is then transfected into and expressed in a cellular expression system, 
preferably an expression system wherein the mutated procollagen is secreted. In less 
preferred embodiments, the recombinant PROcollagen is not sufficiently secreted, and 
recovery entails lysing the cells and centrifuging the lysate to remove cellular debris. In one 
embodiment, the cells are selected or engineered to express both the recombinant procollagen 
and a site-specific proteolytic agent capable of specifically cleaving the procollagen at the 
recognition site. Alternatively, the appropriate site-specific proteolytic agents are introduced 
in an amount sufficient to cleave the propeptides from the procollagen molecules prior to 
fibrillogenesis in the processing steps. 

Collagen chains are purified from the medium by methods known in the art. See, for 
example, Miller and Rhodes (1982) Methods in Enzymology 82:33-64; Sage and Bernstein 
(1982) Methods in Enzymology 82:96-127. Resulting collagen compositions may be 
sterilized, for example, by passage through 0.2 |im filters, (e.g., Millipore filters). Sterility 
may be confirmed by, for example, culturing aliquots on nutrient agar for at least 72 hours at 
25 and 37°C. For therapeutic uses, it is desirable to ensure the non-toxicity, biocompatibility 
and non-immunogenicity of the resulting recombinant collagen compositions. For example, 
non-immunogenicity in humans is confirmed by dermal sensitivity testing. 
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The invention provides a wide variety of applications of the subject recombinant 
procollagens and collagens in tissue and cell culture and therapy. For example, the subject 
procollagens and collagens are used to promote adhesion and growth of cells on a wide 
variety of solid substrates in various forms, such as slides, filaments, sheets, plates, flasks, 
5 bottles, fibers, etc., and compositions such as plastics, glass, metals, saccharide-based 

polymers, etc. The collagens provide a wide range of therapeutic applications such as use in 
biodegradable sutures, synthetic dermal skin, biodegradable hemostatic sponges, tissue 
augmentation (including modifying localized soft tissue appearance), etc. 

Tissue argumentation using collagen compositions has been thoroughly studied, 

10 documented, and implemented in wide-spread clinical settings. See, for example, DeLustro et 
al. (1986) J Biomed Mater Res, 20, 109-120; Elson ML (1989) J Dermatol Surg Oncol, 15, 
301-303; Elson ML (1992) Am J Cosmetic Surg 9, 267-271; Klein et al. (1985) J Dermatol 
Surg Oncol, 1 1, 337-339; Klein et al. (1989) Geriatric Dermatology: Clinical Diagnosis and 
Practical Therapy, chapter 8, pp 47-49, Igaku-Shoin (publ) New York; Kligman AM (1988) J 

15 Dermatol Surg Oncol 14 (suppl I), 35-38; Kligman et al. (1986) J Dermatol Surg Oncol, 12, 
351-357; Kligman et al. (1988) J Dermatol Surg Oncol, 14 (suppl I), 10-12; Knapp et al. 
(1977) Plas Reconstr Surg, 60, 398-405; Knapp et al. (1977) J Surg Res, 23, 96-105; 
Varnavides et al. (1987) Br J Dermatol 1 16, 199-206; etc. 

20 

EXAMPLES 

The following examples are offered by way of illustration and not by way of 
limitation. 

Example 1 

25 (Three-Dimensional Computer Modeling of Single Site-Specific 

Protease Recognition Site Insertion) 
Three-dimensional models for the N- and C- terminal propeptide-telopeptide regions 
were performed as outlined above. Figures 1 and 2 depict the energy minimized structures of 
the N- and C- terminal propeptide/telopeptide regions of human type I procollagen, 
30 respectively. Substitutions were then inserted at the N-terminal cleavage junction site to 
generate a non-natural site-specific proteolytic agent recognition site and new models 
prepared to determine their affects on procollagen folding. 

Figure 3 depicts an N-terminal fragment containing a DDDK (SEQ ID NO: 1:) 
insertion in all three chains, introducing a recognition and cleavage site for enterokinase into 
35 the propeptide-telopeptide junction region. As shown in Figure 4, the proteolytic cleavage 
site of this mutant is particularly accessible in the energy-minimized structure. 
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Example 2 

(Generation of Molecular Constructs for Recombinant Human Procollagens) 

A. cDNA Clones Encoding Human a 1(1) and a2(I) Procollagens 

A cDNA clone containing the entire coding region of human a 1(1) procollagen is 
5 obtained from the Whitehead Institute (Cambridge, MA). The description of the cDNA is 
described in Stacy, et al, J. Virol 61: 2549 (1987). The nucleotide sequence of this clone 
(pHUC) is described in Tromp, et al, Biochem. J. 253:919 (1988), and also under Accession 
No. X07884 (Genbank, Los Alamos, NM). A cDNA clone (pSTL14) containing the entire 
coding region of human oc2(I) procollagen is described in Lee, et al, J. Biol Chem, 
10 263:13414 (1988). The nucleotide sequence of human a2(I) procollagen is listed under 
Accession No. J03464 (Genbank), and the numbering system used assigns nt 1 to the 
transcription start point, according to Dickson et al., Nucl. Acids Res. 13:3427 (1985). 

B. Introduction of Cleavage Site into the Junction of N-Propeptide and N-Telopeptide of 
1 5 the Human a 1 (I) Procollagen cDNA 

A specialized cloning vector (pPLl) is created by isolating the Afllll/Sspl region of 
Bluescript II KSM13+ from nucleotide position 1 153 to 2850 (Stratagene Cloning Systems, 
La Jolla, CA) and inserting a double-stranded deoxyoligonucleotide polylinker containing the 
restriction sites (in order) Hindlll, PvuII, Dralll, and BamHI. Using pHUC as a template, a 

20 618 nt region of the 5' end of the cDNA is amplified by the Polymerase Chain Reaction 

(Perkin Elmer Cetus, Norwalk, CT, according to the manufacturer's instructions) using the 
oligodeoxynucleotide primer #1 ACGCCAAGCTTGGCTGCAGGTCGA (SEQ ID NO:22:) 
and primer #2 pCTG TTTATCATCGTC GGGAGCAAAGTTTCCTCC (SEQ ID NO:23:). 
The underlined nucleotide sequence in primer #2 codes for the amino acid recognition site 

25 (DDDK) (SEQ ID NO: 1 :) for the proteolytic enzyme enterokinase. The 6 10 nt DNA 

fragment generated by the PCR reaction is cleaved with the restriction enzyme Hindlll, and 
cloned into pPLl at the Hindlll/PvuII sites, creating pPLl-1. Next, the PvuII/Dralll 
fragment from the human al(I) procollagen cDNA (nt 858 to nt 1797) is cloned into pPLl-1 
at the PvuII/Dralll sites, creating pPLl -2. The next step is to insert the PvuII/PvuII (nt 569 

30 to nt858) fragment of the human otl(I) procollagen cDNA into pPLl-2 at the PvuII site, 

creating pPLl -3. This PvuII/PvuII fragment inserts itself into the vector in either orientation, 
so clones which contain the orientation identical to the human a 1(1) procollagen cDNA are 
identified by DNA sequence analysis following the manufacturer's protocol (using 
Sequenase™ DNA sequencing kit by U.S. Biochemicals, Cleveland, OH, and the 

35 deoxynucleotide GAGTCACCCACCGACCAAGA (SEQ ID NO:24:) as the primer). The 
cDNA fragment from the human a 1(1) procollagen cDNA from the Dralll site at position 
1797 to the BamHI site at position 2886 is cloned into pPLl-3 at the Dralll/BamHI site, 
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creating pPLl-4. The product of these steps is the continuous region of the human a 1(1) 
procollagen cDNA from the 5' end of the cDNA to the BamHI site at position 2886 with the 
introduction of a coding region for the amino acids DDDK, which is the amino acid 
recognition site for enterokinase. 

5 

C. Introduction of Cleavage Site into the Junction of C-Telopeptide and C-Propeptide of 

the Human a 1(1) Procollagen cDNA 

Another specialized cloning vector (pPL2) is created by the following procedure 

described above, but using a polylinker which contains the recognition sites for the restriction 
10 enzymes (in order) Sail, Hindlll, BamHI, BspEII, and Sspl. First, a fragment of the human 

al(I) procollagen cDNA from the BamHI site at position 2886 to the BspEI site at position 

3817 is cloned into the vector at the BamHI/BspEI site, creating pPL2-l. pPL2-l is digested 

with the restriction enzymes Eco47III and Xhol, and the 106 bp fragment is discarded. A 

double-stranded deoxyoligonucleotide with the sequence 
1 5 GCTGGTTTCGACTTC AGCTTCCTCCC 

CCAGCCACCTCAAGAGAAGGCTCACGATGGTGGC 

CGCTACTACCGGGCTTGGGATGATGCCAATGTGGTTCGTGACCGTGACC (SEQ ID 
NO:25:) is inserted into pPL2-l at the Eco47III/XhoI site, creating pPL2-2. This double- 
stranded deoxyoligonucleotide contains three additional (underlined) deoxynucleotides (as 
20 compared to the human a 1(1) procollagen cDNA in this region) which code for the amino 
acid tryptophan. Next, a fragment of the human a 1(1) procollagen cDNA from nt 3818 to 
461 1 is removed by the restriction enzymes BspEI and Sspl and inserted into pPL2-2 at the 
BspEI/SspI site, creating pPL2-3. 

25 D. Assembly of Human a 1 (I) Procollagen cDN A with Non-Natural Cleavage Sites at 
the Propeptide-Telopeptide Junctions 

The human a 1(1) procollagen cDNA containing additional nucleotide sequences 
within the cDNA, which code for additional amino acid(s) at the N-terminal propeptide- 
telopeptide and C-terminal telopeptide-propeptide junctions, is created by removal of the 

30 Hindlll/BamHI fragment from pPLl-4 containing the 5' region of the human a 1(1) 

procollagen cDNA, as described in section B above, and inserting this fragment into the 
Hindlll/BamHI sites of pPL2-3 containing the 3' region of the human a 1(1) procollagen 
cDNA, creating pM a 1(1). This results in an a 1(1) procollagen cDNA that codes for a 
polypeptide which can be cleaved at the N-terminal propeptide-telopeptide junction with 

35 enterokinase, leaving no additional terminal amino acids, and at the C-terminal telopeptide- 
propeptide with N-chlorosuccinimide, leaving the terminal amino acid tryptophan, according 
to the method described in Saito et al, J. Biochem. 101: 123 (1987). 
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E. Use of Heterologous N-Propeptide and Introduction of Cleavage Site into the 
Junction of N-Propeptide and N-Telopeptide of the Human a2(I) Procollagen cDNA 

A specialized cloning vector (pPL3) is created as described in section B, but using a 
polylinker which contains the restriction sites (in order) Sail, SphI, Pmll, MscI, and 
5 Sse8387L The human a2(I) procollagen cDNA is excised from pSTL14 using SphI and 

Smal restriction enzymes. The resulting DNA fragment is treated with T4 DNA polymerase 
in the presence of deoxynucleotides to make a blunt end at the 5' end of the cDNA. This 
fragment is subcloned into the Bluescript II vector at the EcoRV and Smal sites. The a2(I) 
procollagen cDNA fragment inserts itself into the vector in either orientation, so clones in 
10 which the Smal site is reconstituted are selected and given the name of p a2(I) . Polymerase 
Chain Reaction is performed according to the manufacturer's instructions (Perkin Elmer 
Cetus, Norwalk, CT) using Saccharomyces cerevisiae genomic DNA as template and the 
phosphorylated deoxyoligonucleotides 

pGTGGTATGCAAATTTTTGTCAAGACTTTGACTG (SEQ ID NO:26:) and 

1 5 pCCACCTCTCAATCTCAAGACT (SEQ ID NO:27:) as primers to isolate a fragment of the 
yeast gene UBI1 (ubiquitin) described in Oezkaynak et al, EMBO J. 6: 1429 and listed under 
Accession No. X05728 (Genbank). The 232 bp DNA fragment is subcloned into the vector 
at the Pmll/MscI site, creating pPL3-l. This fragment inserts itself into the vector in either 
orientation, such that clones which contain the orientation identical to that of the human a2(I) 

20 procollagen cDNA are identified by digestion with the restrictin enzymes Pmll and MscI. A 
second PCR reaction is performed using p a2(I) DNA as the template and the two 
deoxyoligonucleotides pCCAGTATGATGGAAAAGGAGTT (SEQ ID NO:28:) and 
CGAGCACCTGCAGGACCAGTT (SEQ ID NO:29:) as the primers for this reaction. The 
159 bp DNA fragment is digested with Sse8387I and subcloned into pPL3-l at the 

25 MscI/Sse8387I sites, creating pPL3-2. p a2(I) is digested with Sail and Pmll, and the 199 nt 
fragment is isolated and inserted into pPL3-2 at the Sail and Pmll sites, creating pPL3-3. p 
<x2(I) is digested with Sail and Sse8387I, and the 475 bp fragment is discarded. pPL3-3 is 
digested with Sail and Sse8387I, and the 600 bp DNA fragment is isolated and cloned into p 
a2(I) at the Sail and Sse8387I sites, creating pPL3-4. This creates human a2(I) procollagen 

30 cDNA which contains the endogenous signal sequence, but codes for the yeast ubiquitin 
polypeptide, which replaces the endogenous N-propeptide. The human oc2(I) procollagen 
can be cleaved at the N— propeptide - N-telopeptide junction with a ubiquitin-specific 
processing protease, as described in Tobias and Varshavsky, J. Biol Chem. 266: 12021 
(1991). 

35 

F. Use of Heterologous C-Propeptide and Introduction of Cleavage Site into the 
Junction of C-Telopeptide and C-Propeptide of the Human a2(I) Procollagen cDNA 
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p a2(I) is digested with Avrll and EcoRI, and the 909 bp DNA fragment is discarded. 
Next, Polymerase Chain Reaction is performed according to the manufacturer's instructions 
using p a2(I) as template and the deoxyoligonucleotides 
TGGTCCTAGGGGCCCTGCTGGTCCTT (SEQ ID NO:30:) and 
5 pTCTAjCCTrcTATAGCCCTGTAGAAGTCTCCATCGTA (SEQ ID NO:3 1 :) as primers. 
The underlined nucleotide sequence in the second primer codes for the amino acid 
recognition site (IEGR) (SEQ ID NO:02:) for Factor X, A DNA product of 220 bp is isolated 
and then digested with Avrll. A second PCR reaction is performed using RT-PCR according 
to the manufacturer's instructions (Perkin Elmer Cetus, Norwalk, CT) using as template total 

10 RNA isolated from mouse spleen by the procedure of Han, et al, Biochem, 26, 1617 (1987). 

Random hexamers are used as primers for the reverse transcription of mouse RNA. The two 
PCR primers in the PCR reaction are pAAACAGAAGCACCAGTCGGTATTCA (SEQ ID 
NO:32:) and ATATGAATTCTCAGGTGCACGCACAGGTTGGCCGTA (SEQ ID NO:33:). 
A DNA product of 215 bp which encodes the C-terminus of mouse complement Clq-C 

15 subunit (see Petry et al., Eur. J. Biochem. (1992) 209: 129 and Genbank Accession No. 

X66295) is isolated and digested with EcoRI. The two digested PCR products are ligated 
into p ct2(I) previously digested with Avrll and EcoRI as described above, creating pPL4-l. 
These steps create a human cc2(I) procollagen cDNA having the mouse complement Clq-C 
subunit C-terminus, which replaces the human ot2(I) procollagen C-propeptide and 

20 introduces a IEGR (SEQ ID NO:2:) recognition site for the protease Factor X a at the junction 
of the C-telopeptide and C-propeptide. 

G. Assembly of Human a2(I) Procollagen cDNA with Non-Natural Propeptide Domains 
and Non-Natural Cleavage Sites at the Propeptide-Telopeptide Junctions 

25 The human a2(I) procollagen cDNA containing ubiquitin in place of the endogenous 

N-propeptide and the C-terminus of complement Clq-C subunit instead of the endogenous C- 
propeptide is created by removing the 2850 bp Xhol/Xhol fragment from PL3-4 (described in 
section E), inserting this DNA fragment into PL4-1 (described in section F) digested with 
Xhol, and discarding the 2725 bp fragment, creating pM a2(I). The Xhol fragment from 

30 pPL3-4 can insert itself into pPL4-l in either orientation. The orientation which places the 
translation start codon toward the 5' end of the construction is selected using DNA sequence 
analysis. This results in a a2(I) procollagen cDNA which can be cleaved at the N-terminal 
propeptide - telopeptide junction with ubiquitin specific processing protease, and at the C- 
terminal telopeptide - propeptide junction with Factor X,, leaving the terminal amino acids 

35 IEGR(SEQIDNO:2:). 

Example 3 
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(Production of Recombinant Human Collagen) 
pSV-SPORTl (Life Technologies, Ind., Gaithersburg, MD) is used as the expression 
vector for the mutated al(I)procollagen cDNA (pM a 1(1) from example 2D) and the mutated 
a2(I) procollagen cDNA (pM a2(I) from Example 2G). pSV-SPORTl is digested with the 
5 restriction enzymes Sail and Sspl. The fragment containing the mutated a 1(1) procollagen 
cDNA is ligated into pSV-SPORTl at the Sail and SnaBI sites, creating PSV-M a 1(1). pM 
a2(I) is digested with Sail and Smal. The fragment containing the mutated a2(I) procollagen 
is ligated into pSV-SPORTl at the Sail and SnaBI sites, creating pSV-M oc2(I). 

Either pSV-M a 1(1) or pSV-M a2(I) is co-transfected with pRSV neo into HT1080 

10 cells (American Type Culture Collection, Rockville, MD) using Lipofectamine Reagent (Life 
Technologies, Gaithersburg, MD) and following the manufacturer's protocol for stable 
transfection of adherent cells. The mutated procollagen product from HT1080 cells 
transfected with either pSV-M a 1(1) or pSV-M a2(I) is secreted into the medium as 
homotrimeric molecules. After 24 hours, the medium is collected and protease inhibitors are 

1 5 added to a final concentration of lOmM N-ethylmaleimide, ImM PMSF, and 25 mM EDTA. 
Ammonium sulfate is added to the medium to a final concentration of 30%, and the medium 
is stored at 4°C for 16 hours. The medium is centrifuged at 4000 rpm for 10 minutes at 4°C. 
The pellet is resuspended in 50 mM Tris, pH 7.4, 150 mM NaCl, and protease inhibitors at 
the concentrations listed above, or in the appropriate buffer for cleavage of one or both of the 

20 propeptides from the collagen molecule. See the following sources for specific cleavage 

agents and reaction conditions: for enterokinase, Sigma Chemical Company, St. Louis, MO; 
for tryptophan chemical cleavage, Saito et al., J. Biochem. (1987) 101:1230134; for ubiquitin 
specific processing enzyme, Tobias and Varshavsky, J. Biol. Chem. (1991) 266:12021; for 
Factor X a , New England Biolabs, Beverly, MA. 

25 

Example 4 

(Use of Recombinant Human Collagen in Soft Tissue Augmentation) 
Recombinant human collagen produced as described in Example 3 is adjusted to a 
collagen concentration of 10 mg/ml by addition of 0.13 M phosphate buffer at pH 7.2 under 
30 conditions favoring fiber formation. The resulting suspension of fibrillar collagen is 

centrifuged at 10,000 x g to concentrate the collagen, which is subsequently rediluted to a 
concentration of 35 mg/ml by the addition of phosphate buffer. The resulting fibrillar 
collagen suspension is aliquotted into syringes for use in soft tissue augmentation. 

A physician attaches a 25-guage or smaller needle to a syringe containing the 
35 recombinant human fibrillar collagen suspension prepared as described above. After 

attaching the needle to the collagen-containing syringe, the physician injects the collagen 
subcutaneously into the dermis, using multiple serial punctures, to soft tissue sites in need of 
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augmentation, such as acne scars and wrinkles. Collagen is injected to a particular soft tissue 
site until the previously depressed area is even with the surface of the surrounding normal 
skin. 



5 Example 5 

(Cultureware Coating Assay: Human Dermal Fibroblast Contraction Assay) 
Recombinant human collagens are assayed and compared with purified human 
placental, bovine hide, and rat tail collagens for their ability to support gel contraction by 
human dermal fibroblasts. Recombinant human collagens are prepared as described in 
10 Example 3. Human collagen is purified from human placenta by extraction with 0. 1 M acetic 
acid, followed by precipitation of type I collagen by the addition of 1 M NaCl. Bovine hide 
and rat tail collagens are commercially obtained in soluble form (e.g., Vitrogen). 

Primary human dermal fibroblasts obtained from American Type Tissue Culture 
Collection are suspended in DMEM containing 10% (v/v) fetal calf serum at 1.0-1.4 x 10 5 
15 cells/ml. One (1) milliliter of cell suspension and 2 ml of the same culture medium are added 
to 1 ml of recombinant collagen at a concentration of 2.5 mg/ml in 10 mM acetic acid. The 
cell suspension-collagen solution is poured into a 35-mm polystyrene petri dish (Costar) and 
incubated in a 37°C tissue culture incubator in 5% C0 2 for 5 days to allow contraction to 
occur. 

20 Rates of contraction are measured each day by placing the culture dish containing the 

contracted cell matrix over a template containing circles of varying diameter, then estimating 
the diameter of the contracted cell matrix by comparison with the circles on the template. 
Recombinant human collagens prepared as described in Example 2 are shown to promote 
fibroblast contraction at rates at least comparable to those obtained using rat tail, purified 

25 human placental, and bovine hide collagens. 

All publications and patent applications cited in this specification are herein 
incorporated by reference, as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. Although the 
30 foregoing invention has been described in some detail by way of illustration and example for 
purposes of clarity of understanding, it will be readily apparent to those of ordinary skill in 
the art, in light of the teachings of this invention, that certain changes and modifications may 
be made thereto without departing from the spirit or scope of the appended claims. 

35 
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